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We describe how to use propositional model counting for a quantitative analysis of product config- 
uration data. Our approach computes valuable meta information such as the total number of valid 
configurations or the relative frequency of components. This information can be used to assess the 
severity of documentation errors or to measure documentation quality. As an application example 
we show how we apply these methods to product documentation formulas of the Mercedes-Benz line 
of vehicles. In order to process these large formulas we developed and implemented a new model 
counter for non-CNF formulas. Our model counter can process formulas, whose CNF representations 
could not be processed up till now. 

1 Introduction 

Since Rl/XCON [15] was used by DEC to support computer system configuration and assembly, product 
configuration systems have been among the most prominent and successful applications of AI methods 
in practice [19]. As a result computer aided configuration systems have been used in managing complex 
software, hardware or network settings. Another application area of these configuration systems is the 
automotive industry. Here they helped to realize the transition from the mass production paradigm to 
present-day mass customization. 

Model counting is a technique to count the number of different satisfying variable assignments of 
a formula in propositional logic. Up to now model counting has mostly been used within bayesian 
networks [22, 1] and planning problems [17, 14]. In contrast, our work aims at using propositional 
model counting for a quantitative analysis of comprehensive product configuration setups. 

To demonstrate the applicability of our novel methods, we provide examples within a well-studied 
car configuration context. The German automobile industry follows a build-to-order strategy based on an 
exceedingly large product variety. This makes it possible, especially for the manufacturers of luxury cars, 
to offer each customer their unique tailor-made car, and it differentiates the model lines from the mass 
market. Pil and Holweg [18] have discussed the interconnection of product variety and order-fulfillment 
strategies. 

In this context it is interesting for the management to know precisely how much variety there is in the 
product line. For the automobile industry, this question is surprisingly difficult to answer. In particular, 
sales options cannot simply be multiplied out because of complex interdependencies (e.g. automatic 
transmission being standard on U.S. exports, and not available on European cars with small engines). Pil 
and Holweg published data for the year 2002, based on an analysis of company material, about customer 
selectable variations. In summary, numbers range from the order of 10 3 for the smallest Peugeot and 
Nissan models to the order of 10 8 or 10 9 for the GM Astra and Corsa or the Ford Focus. In the luxury 
sector, the BMW 3-Series was estimated at 6 ■ 10 16 , and the Mercedes C-Class at 10 21 . Finally, for the 
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Mercedes E-Class, 2 body styles, 15 power trains, 285 paint-and-trim options, and 70 other factory-fitted 
options could be combined to variations on the order of 3 • 10 24 . It gets even worse: Variations induced by 
sales to countries outside Europe (due to different emission standards, fuel grades etc.) apparently were 
not considered by Pil and Holweg. And customer visible variations are only a subset of the variations 
afforded by technical and legal restrictions. 

New technology options, such as hybridization, are a continuous source of new variability, and there- 
fore the issue of rising complexity must have the ongoing attention of management. Since we work with 
our industrial partner Mercedes-Benz on the symbolic verification of configuration problems [13], such 
as whether a given configuration is valid or not, or which part is required in which configurations, we 
were able to investigate how many valid configurations there are for one specific car line. Obviously, 
there are a number of useful questions associated with this number. Some of these are: How does the 
complexity evolve over time? Which change in complexity would result from adding or canceling an 
equipment option? What is the complexity of each body style? Of each engine or power train configura- 
tion? In each country on Earth? For which percentage of the total variations is a particular part needed? 
In practice, detailed questions such as the above can only be answered precisely if the number of varia- 
tions can be computed automatically from the manufacturer's configuration data. This was not possible 
up to now, which also means that the above numbers were not verified independently so far. 

This paper covers only the first steps towards using model counting in configuration and manufac- 
turing problems. Clearly, some kind of additional information about the expected or observed frequency 
of chosen options in the real product should be included in the model counts. E.g. an option can be 
allowed in 90% of all valid configurations, but is chosen only in 10% of actual orders. Still, to the best 
of our knowledge our results represent the first cases where exact model counts were obtained directly 
and automatically from the internal manufacturer configuration data of a current highly complex product 
line. We are certain that more ways will be found in which our methods can be used for computing 
management information. Also, more extensive data sets will certainly be produced in the future. 

The plan of this paper as follows: In Section 2 we summarize the theory of model counting and give 
a short overview of current algorithmic approaches. Section 3 presents possible application fields for 
model counting in configuration problems. In Section 4 we discuss the specific application of our new 
methods in the customer order process at Mercedes-Benz. We give a short description of our new non- 
CNF model counter ncnt and present benchmarks of configuration problems used in the current product 
line. Section 5 finally summarizes our contributions and points at some future research directions. 

2 Model Counting 

Let (p be a formula in propositional logic, and let vars(<p) denote the finite set of variables occurring in 
the formula (p. An assignment a for <p is a partial function a : vars(<p) — > {T,±} mapping variables 
to truth values T and _L. We follow the convention to write a \= (p when some formula (p holds with 
respect to a. Propositional model counting or #SAT is the problem of computing the number of all full 
assignments a for which (p holds, i.e. \{a \ a \= (p}\. 

Analogous to SAT, which is the canonical NP-complete problem, #SAT is the canonical #P- complete 
problem. The complexity class #P is the class of all problems p for which there exists a non-deterministic 
polynomial-time bound Turing machine M(p) such that for each instance I(p) of p there exist exactly as 
many computation paths of M{p) as solutions for I(p). Intuitively #P is the class of counting problems 
for polynomial-time decidable problems. According to [25] even the counting problems for polynomial- 
time solvable problems like 2-SAT, Horn-SAT, or DNF-SAT can be #P-complete. 
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In this paper we only deal with exact model counting (in contrast to approximative counting). We 
distinguish between two different approaches for exact counting: (1) DPLL-like exhaustive search and 
(2) knowledge compilation. 

The vast majority of successful SAT solvers uses the DPLL approach [10, 9]. DPLL is basically 
a complete search in the search space of all 2" variable assignments with early cuts in the search tree 
when an unsatisfiable branch is detected. DPLL-style model counters like CDP [4], RelSat [2], or 
Cachet [20] are extensions to existing SAT solvers and require an input formula in CNF. If a formula <p 
with n variables is not satisfiable, the output is 0. If a satisfying (and possible partial) assignment a is 
found, the number of models for this a is computed with 2"~l a l and the algorithm proceeds to explore 
the rest of the search tree. There are two important improvements of this DPLL-based approach. The 
first one is component analysis [2] where one identifies different components C\ , . . . , C n in the constraint 
graph G of a CNF formula (p. Let <p 1; ..., (p n be the sub-formulas of q> corresponding to the components 
C\, . . . ,C„. Then the model count #SAT(q>) is equal to #SAT((pi) x • • • x #SAT((p n ), thus we can calculate 
the model count of each component independently. This identification of components can be performed 
dynamically while descending into the search tree. The second improvement is the #SAT correspondence 
to clause learning in SAT: component caching [20, 24]. Since during the counting process we often 
compute counts for the same sub-formulas multiple times, we cache signatures of sub-formulas and their 
model count according to certain caching schemes. Variable selection heuristics as known from SAT 
have to be adjusted for #SAT wrt. component analysis and caching: While in SAT one tries to narrow 
down the search to one specific solution by intelligently choosing the branching variables, in #SAT we 
try to choose variables where the according constraint graph is decomposed in various components [21]. 

In the knowledge compilation based approach we convert the formula <p into another logical rep- 
resentation such that #SAT(cp) can be computed in polynomial time. One well known approach for 
this is the compilation of <p into a binary decision diagram (BDD) [5]. Once we have the BDD, we 
can count all paths from the root node to the T labeled node to get the model count of the formula at 
hand. Narodytska and Walsh discussed this approach for configuration problems [16]. However our 
own experiments with the formulas presented in Section 4 showed that for these large formulas, which 
emerge in our industrial automotive project, the compilation into BDDs is not feasible. Another logical 
representation for prepositional formulas, which is used e.g. in the system c2d [8], is the deterministic 
decomposable negation normal form (d-DNNF) [7]. A DNNF is an extension of a negation normal form 
where for each conjunction A?=o/i w i tn sub-formulas fo,... ,f n it must hold that vars(/)) n vars(/ ; ) = 
for all < i < j < n. In a d-DNNF we have the additional condition that for each disjunction \f" =0 gi 
with sub-formulas G = {go, ... ,g n } it must hold that for each a we have \{gj £ G \ a \= gi}\ < I, i.e. no 
sub-formulas share the same satisfying assignment. Once we have the d-DNNF <p' of a formula (p we 
can count all models by these two rules: 

#SAT(f\fi) = f\#SAT(fi) , #SAT(\f gi ) = f j #SAT(g i ). 

!=0 '=0 1=0 i'=0 

It turns out that a d-DNNF representation is closer to the original CNF formula and therefore is easier to 
compute. 

3 Applications in Product Configuration 

In this section we will point out possible fields of applications for model counting in product configu- 
ration. We use the definition of a configuration problem as given in [11, Definition 1]: a configuration 
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problem is a triple {Y ,D,m) where "V is a set of variables x\,X2,. ■ ■ ,x n , Dis a set of their finite domains 
D\,D2,..,D n and *P = {l^i, 1/A2, . . . , yA m } is a set of prepositional formulas over atomic propositions 
xt = v where v G D,, specifying conditions that the variable assignments have to satisfy. For each formula 
i^G'Pwe have vars(i//) C "V . A valid configuration is an assignment a with dom(a) = "V such that 

In this paper we consider the special case where we have only prepositional variables in "V and 
hence D, = {T, _L} for all 1 < i < n. The set & is the finite set of all configuration options for a product. 
Each variable x € "V represents a configuration option o £ 0. The variable x is assigned to T if 
the option o is chosen, otherwise it is assigned to _L. Following this course, the resulting formulas 
Y £ *P are prepositional formulas and hence <p = Awe* ¥ * s a prepositional formula describing all valid 
configurations. We will also refer to q> as product overview formula (POF) [13]. 

Remark. The restriction of the variables x € ^ to prepositional variables does not limit the ex- 
pressiveness of our problem description. Since the domains D, are finite and we only allow atomic 
propositions of the form x = v, we can use a reduction [6] from equality logic to prepositional logic. 

3.1 Number of Valid Configurations 

The first question which naturally arises is the total number of valid product configurations v = #SAT((p). 
Obviously 2^ I is an upper bound for v, but in most cases v <C 2^. Nevertheless this number v can often 
demonstrate the sheer complexity of a given product. Subsection 4.2 supports this claim by presenting 
some of these numbers for the car lines of Mercedes-Benz. 

One can also count valid configurations of a product under certain preconditions. E.g. we can force 
options P C G to be chosen before performing #SAT((p). This can be achieved by computing #SAT((p A 
f\ peP x p ). This method can yield important information about the influence of certain options o on the 
number of valid configurations. This information again can be used for special domain-specific variable 
heuristics for the SAT solving process of formulas of the application area at hand. The main idea of 
state-of-the-art SAT solvers is to narrow down the search space as fast as possible. Thus when we know 
a set of variables XCf representing configuration options P C & which reduce the number of models 
of a given formula to a great extent, we can give these variables x GX high activity in the SAT solving 
process of formulas representing configuration problems of the same product line. 

Preselecting a certain option can also be used to compute the relative frequency of this option in valid 
configurations. We compute the frequency f(o) of a given option o with 

#SAT((pAx t) ) 
J[0) #SAT((p) ' 

f(o) can be used as additional information to statistical data about the frequency of options for demand 
estimation or process optimization. 

3.2 Rating Errors 

Quite important issues arise when reporting errors. Observations from formal methods in software veri- 
fication [3] tell us that the more bugs you report, the smaller the probability gets that they will eventually 
be fixed. Developers as well as product documentation engineers tend to get overwhelmed quite quickly 
by extensive error reports leaving them uncertain where to start correcting defects. 

Model counting might help classifying errors according to their severity. We consider scenarios 
in which satisfiability of the input formula <p indicates error situations — hence any satisfying assign- 
ment may be interpreted as a counterexample. In configuration problems such situations mostly arise 
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when checking mutually exclusive component inclusion. Let c\,C2 G vars(<p) denote binary flags for 
the inclusion of two mutually exclusive components. Each assignment satisfying (p /\c\ AC2 may be 
interpreted as a product featuring both mutually exclusive components at the same time, thus computing 
#SAT((p Aci AC2) results in the total number of invalid configurations w.r.t. c\,c%. Formulas yielding 
many such "counterexamples" intuitively tend to fail more likely in practice than formulas yielding only 
negligible numbers. However, if there are very few models for a given error, we assume that this error is 
more intricate than others. 

Measuring error severity using model counters as proposed may however turn out not to be applicable 
in every domain. Think of Boolean encodings of configuration options where each option o is assigned 
to a value of a finite integer domain D and a consistency assertion a(o) fixing o to a specific integer 
value i. If checking the Boolean encoding w.r.t. a(o) yields that o is not restricted to i but may instead 
take an arbitrary value of D, model counting will consequently return some large number of invalid 
configurations. In case o is some exotic configuration option, using the computed number as a measure 
of severity might be misleading. 

3.3 Measuring Documentation Quality 

Often the individual constraints i(/£f are reflected in a product documentation. This product docu- 
mentation is either automatically produced or manually maintained by experts. For complex products 
with thousands of options this documentation evolves over years. This fact can be observed in the devel- 
opment of the numbers of valid configurations of different versions of the documentation for the same 
product. We made observations where for the same product the number of valid configurations went 
down from a magnitude of 10 34 to a magnitude of 10 10 due to a better and more detailed documentation 
(c.f. Subsection 4.2). 

Model counting can also be used interactively by the documentation engineer. One can immediately 
see the impact of adding or changing certain constraints in terms of valid configurations. E.g. when 
adding a new constraint does not change the number of valid configurations it can be redundant and 
therefore can be omitted. 



4 Example: Automotive Product Configuration 

In this section we give examples how to apply prepositional model counting to formulas as used in 
constructibility checking of customer orders at Mercedes-Benz. Therefore we first give a brief introduc- 
tion of the notions used in the product documentation and of the configuration system used in the mass 
production of individually configured, personalized cars. 

4.1 Formula Semantics 

For the Mercedes lines of cars, product documentation is done based on prepositional logic. For each 
vehicle class, there are about 1.500 variables which represent the configuration options. Processing of 
customer orders at Mercedes-Benz is organized as a three-staged process [13]: 

1. The customer's choice of configuration options P C G with P = {x 0i ,... ,x , p , } is compiled into an 
initial assignment a = {x 0l <— T, . . . ,x , p , <— T}. We refer to prepositional variables representing 
an option o € G also as codes. 
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2. The assignment a computed in Step 1 is iteratively extended using so called supplementary rules. 
A supplementary rule S(x) — > x consists of a conditional part S(x) in prepositional logic and a 
supplementary code x. If a \= S(x) for the current assignment a we extend a = a U {x <— T}. 

3. For each code x there is a constructibility condition in prepositional logic of the form x — > C(x). 
The initial assignment a of Step 1, supplemented in Step 2, is finally checked for constructibility 
in this third step. If a \= Axedom(a)C( x ) holds, the customer order is constructible otherwise it will 
be refused. 

Let X be the set of all allowed codes for a given line of vehicles, then the afore mentioned POF is 
defined as follows 

POF(X) = f\ (S(x) -»• *) A (x -> C{x)). 
xex 

Example. Consider as a toy example a vehicle where the configuration options are three different engines 
with codes e\, ei, e$, two different gearboxes g\, g2 and three additional features a\, ai, a^. Engine e\ 
must be combined with gearbox g\ (sri), e2 must be combined with g2 (sr2). In a car with e^ and gi also 
a% has to be chosen (sr3), in a car with £3 and g2, we must choose a 3 (sr/j). The resulting supplementary 
rules are 

sri=ei->-gi, sr 2 = e 2 ^-g2, sr 3 = e 3 Agi ->• a 2 , sr 4 = e 3 Ag 2 -> 03- 

There is exactly one engine in a vehicle (CC1-CC4) and exactly one gearbox (cc 5 -cc 7 ). There is also the 
condition, that feature 02 must not be combined with 03 (ccs). The resulting constructibility conditions 
are 

cci = T — > e\ Ve2V^3, CC2 = e\ — >• ->e% A->e^, CC3 = e2 -^ -&\ A -1^3, CC4 = e^ — > -*ei A ~^e% 

cc 5 = T->gi Vg 2 , cc 6 =,?i ->->g2, ocj=g2^--'gi, cc 8 = a 2 ^^a 3 
The POF is the conjunction of all constructibility conditions and supplementary rules: 

8 4 

POF=/\cc ! A/\sr i . 
1=1 y=l 

The solutions of the POF represent exactly those customer orders (as a set of configuration options) 
which can be built under the configuration constraints. Hence any model a of the POF (i.e. a \= POF) 
describes a single valid configuration. Based on the notion of the POF, several consistency tests such 
as finding necessary codes, detecting redundant parts or intersections may be performed quite naturally 
using SAT solvers [13]. 

Given the variability described above, it is no longer possible to store separate parts lists (bills of 
material - BOM) for each possible order. Therefore, there is a single BOM for the model line (e.g. E- 
Class), where each part p is associated with a selection condition P(p) in prepositional logic. For each 
customer order, given as the supplemented assignment a, every part whose selection condition evaluates 
to true under a is pulled into the order specific BOM. Hence the set of all parts p occurring in the BOM 
is defined as {p \ a \= P(p)}. 

The BOM is actually structured into so-called positions, each of them grouping together a set of parts 
which may be alternatively selected for installation at the same geometric position of the car (e.g. the 
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set of available radios). In addition, there is one fictitious (null-)part which represents the '"nothing-to- 
install'" case. Counting the models of the selection formulas for each part may give an indication of the 
relative frequency of the parts, and of the frequency of the null-case, which often represents some kind 
of exception. 

4.2 Model Counting in the Mercedes-Benz Scenario 

In the rest of this paper, we will present some results as described in Section 3 gathered by model 
counting product documentation formulas in use at Mercedes-Benz. As those formulas are originally 
non-CNF, we developed a new exact propositional model counter that operates directly upon the raw 
formulas without prior normalization. It is based upon a reimplementation of the non-CNF SAT solver 
introduced by Thiffault et al. [23] and uses the DPLL-style approach to model counting with connected 
component detection and caching. 

Remark. When converting a non-CNF formula (p into a CNF formula <p' in order to utilize a CNF 
model counter, it is important to use a transformation method which preserves the number of models 
of <p, i.e. \{a | a \= (p}\ = \{a \ a \= q>'}\. Consider e.g. the original formula <p = a V (b Ac). To get 
an equisatisfiable formula <p' in CNF we can use the Tseitin transformation with either implications 
(T^(cp)) or equivalences (T^((p)): 

T^{<p)= for A (f or ^a\J f and ) A (f and ^bAc) = f or A (-./ , V a V f and ) A (-.fand V b)(-.f an d V c) 

T^((p) = f or A (f )r O a V fand) A (fand ^ b Ac) = f or A (-.for V a V fand) A (for V ^o) A (for V ^fand)t\ 

hfand Vb)A (-.fand V c) A (f m d V ~b V -<c) 

We can now compare the number of models for each formula: #SAT((p) = 5, #SAT (T~* (<p)) = 6, and 
#SAT(T^((p)) = 5. We see that only T^((p) preserves the number of models. 

4.2.1 Number of Valid Configurations 

Model counting pure PO formulas without any extensions (i.e. computing #SAT(POF), the number of 
different valid orders) demonstrates the vast multitude of constructible vehicles due to customer choices: 
We have been able to compute results of 5.9 x 10 10 up to 9.9 x 10 103 constructible orders for the E-class 
line of Mercedes-Benz cars (cf. Tab. 1). C212_A, C212_B, and C212_C indicate different body styles of 
the same car line. Those impressive numbers additionally provide an invaluable aid in arguing for formal 
methods use in large configuration settings. Our non-CNF model counter has directly been applied to 
the non-normalized POFs whereas Cachet and c2d have been given Tseitin transformed (1°) CNF 
versions as input. Our comparison states that preserving formula structure by directly reasoning over 
the input without prior normalization pays off in scenarios featuring large input: Cachet timed out after 
6 hours on C212_C respectively exited abnormally on C212_B, c2d has been manually aborted (as it is 
lacking a timeout option) after 6 hours returning no results. 

In table 2 we summarize some computations we performed in order to analyze the number of valid 
orders including individual parts of the BOM, i.e. #5!Ar(POFAP(p)). Each row identifies a position 
variant (e.g. a certain device in the set of available radios), i.e. a part that might be installed at one single 
position in the car. As simultaneously installing different parts at the same position poses a physical 
impossibility, position variants need to be mutually exclusive in order to preserve product documentation 
consistency. Using table 2 one may compute inclusion probabilities for each part depending on the 
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Table 1: Computing the total number of orders 1 









ncnt 


Cachet 


c2d 


problem 


#vars 


#clauses 


result time 


result time 


result time 


C212_A 


4898 


10289 


5.9 x 10 1U 2.57 


5.9 x 10 1U 0.05 


5.9xl0 lu 51.82 


C212_B 


9574 


40809 


9.4 x 10 60 4709.51 


bus error — 


timeout — 


C212_C 


10637 


45990 


9.9 x 10 103 7915.00 


timeout — 


timeout — 



Table 2: Computing the number of orders including specific parts using ncnt^ 





C212J 


C212_C 


part p 


result 


time 


result 


time 


10 


3.6 x 10 3 ' 


5160.25 


4.1 x 10 iUU 


6528.81 


20 


2.7 x 10 58 


7686.87 


2.3 x 10 101 


7900.62 


30 


5.3 x 10 56 


4626.25 


5.5 x 10" 


7620.96 


40 


1.8 xlO 57 


5530.34 


1.9 xlO 100 


10933.19 


60 


6.0 x 10 56 


5607.98 


6.1 x 10" 


8251.84 


70 


6.0 x 10 55 


1833.41 


4.8 x 10 98 


3840.32 


999 


1.2 xlO 45 


111.87 


4.4 x 10 88 


136.41 



corresponding POF. One can clearly see that for the few variants we computed, variant 20 dominates the 
others. These numbers might serve as a basis in applications estimating future requirements in purchasing 
parts or for speeding up SAT solving. 

4.2.2 Rating Errors 

In practice it is highly desirable to rank the findings of an automated verifier. In the real world, nothing 
is perfect, and every corrective action has a price. If we have an error condition, which is represented 
by a formula (p being satisfiable, then #SAT(q>) may give an indication, how many cars are potentially 
affected by the error. 

As an example consider physically overlapping parts pi,pz- If, due to the POF, there is a con- 
structible order featuring both, p\ and pi, any assignment a with a |= POF AP(p\) f\P(p2) describes an 
erroneously constructible order. Thus computing #SAT (POF AP(pi) AP(pi)) will return the total num- 
ber of erroneously constructible orders wrt. p\ and p%. Using those numbers retrieved by model counting 
one may intuitively classify errors as follows: Overlaps leading to a high number of constructible orders 
(thus being more likely to actually occur in production) are intuitively more severe than overlaps fea- 
turing a negligible number. Experts concerned with fixing documentary flaws may thus prioritize their 
work using results produced by model counting. 



4.2.3 Measuring Documentation Quality 

Exact model counts also give rise to questions about the appropriate product documentation language 
and method. We are currently engaged in introducing symbolic verification methods to detect and help 
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avoid documentation errors. Knowing the exact model count gives some indication (at least on the level 
of management decisions) of the complexity of detecting errors. E.g. in order to prove that a part is no 
longer needed in a 2002 model line, it must be verified that none of the 10 24 cars needs the part. 

Our results furthermore indicate that model counting POFs might serve as a measure of product docu- 
mentation maturity (cf. Subsection 3.3): Model counting an early developer's version of C212_A resulted 
in 2.9 x 10 34 models whereas the version in productive use at Mercedes-Benz yields only 5.9 x 10 10 . 

5 Conclusion & Future Work 

In this paper we introduced new methods for the quantitative analysis of configuration formulas using 
model counting. We described methods for gaining additional information like the total number of valid 
configurations, the relative frequency of options in valid configurations, or the severity of errors. This 
information can be used to speed up future SAT solving processes of similar formulas, illustrate the 
complexity of the problem, or measuring the documentation quality. Using our non-CNF model counter 
ncnt we have been able to compute results for formulas whose CNF representations are too large for 
recent model counters to cope with. Our results give proof of the applicability of our newly introduced 
analysis methods to industrial-scale configuration problems. 

As our benchmarks show, model counting for large propositional formulas is still a quite time- 
consuming job to do. In d-DNNF compilation static heuristics based upon hypergraph decomposition 
have been successfully used to generate good variable orders for decomposing large and complex formu- 
las [12]. To our knowledge this approach has not yet been used in non-compilation based settings. Such 
heuristics should be introduced in DPLL-style model counting to speed up decomposition. 

Moreover, decomposing problems into independent connected components quite naturally suggests 
to harness multithreaded CPU architectures by parallelizing propositional model counting. 
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